About me

My name is Shelley (Yiyan). I'm a freelance data scientist with a Masters degree in Math and Statistics focusing on statistical analysis and machine learning. In practice, I have successfully accomplished numerous hands-on data science projects that transformed data into valuable outcomes.

Data science is my passion. I can help you with every step of the data science workflow, including web scraping, data cleaning, data visualization, modeling, model deployment, and documentation.

To achieve these steps, I use a large range of languages/tools such as Python, R, Tableau, Excel, SQL, Sagemaker, AWS, SAS, SPSS, and their packages/libraries.

I’m experienced in the following areas:

  • Machine Learning
    • Supervised Learning (linear models, Lasso, Ridge, SVM, KNN, decision tree, random forest, XGBoost and many more, for regression and classification problems)
    • Unsupervised Learning (clustering and principal component analysis)
  • Deep Learning
    • NLP (RNN, LSTM, Transformers such as BERT, DeepAR, etc.)
    • Computer Vision
  • Exploratory Data Analysis (data cleaning, data visualization and feature engineering)
  • Probability and Statistics (statistical tests such as ANOVA, correlation, etc.)
  • Sampling (Random, Systematic, Convenience, Cluster, and Stratified)
  • Survival analysis (Kaplan-Meier plot, Log-rank test, Cox proportional hazards regression analysis, etc.)
  • Data Collection (e.g., web scraping)

In Deep Learning, I gained certificates on Coursera:

  • Neural Networks and Deep Learning
  • Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization
  • Structuring Machine Learning Projects
  • Sequence Models

Find me